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Analyze application-specific 
HTML DOM and define 
application-specific grammar 



Create yacc source file 
containing defined grammar 
together with C++ statements 
that generate user interface 
(UI) objects 





r 


Run yacc with source file to 
generate C++ source code file 
for the parser executable 
program 
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Compile yacc output file with 
additional C++ code modules, 
e.g., scanner code module 
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Resulting program accepts 
HTML DOM and produces a 
list of UI objects 
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Access HTML DOM in 
web browser and serialize 
the tree of DOM elements 
by traversing DOM 
hierarchy 



Create one or more tokens 
for each DOM element 
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Parse tokenized input 
according to application- 
specific grammar 
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Generate list of UI objects 
that correspond to images 
displayed in the browser 
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Input file containing 
HTML Grammar and 
C++ code outputting UI 
object descriptors 
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YACC 
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